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METHOD AND APPARATUS FOR REUSING 
NON-ERASABLE MEMORY MEDIA 
The invention relates generally to a method 
and apparatus for writing in data storage systems and in 
5 particular to a method and apparatus for writing in a 
non-erasable storage medLum : such. aa a.. dlgitai- optical 
disk or paper tape. 

Backc^round of the Invention 

Non-erasable # or write-once ^ data storage 

10 media such as digital optical disks, paper tape, PROMS, 
etc.r have a plurality of bit positions, each of which 
can be irreversibly changed from an original state to a 
new state but one time. Thus, typically, the initial 
state of the bit position is designated a "zero" and the 

15 "zero" can be overwritten with a "one" when data is 

written into the media. Once the data has been written 
in a section of the media, which may be all of the 
media, that section is considered to be "used" by those 
experienced in the field, and is not considered to be 

20 reusable thereafter for recording new data. 

Some non-erasable media, notably digital 
optlcal.disksr can store -vvast amounts . of -data* A single 
twelve-inch disk can be used to store over 10^ bits of 
data, the equivalent of forty reels of magnetic tape, 

25 and access can be provided to any of it in about 

one-tenth of a second. This is an order of magnitude 
improvement in the cost/performance of memory technology 
and has had dramatic effects. And, while the cost of a 
high capacity, single twelve-inch disk may be only $100, 

30 that high capacity and low cost are achieved only at the 
further philosophical cost of making the writing process 
irreversible. 
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Similar limits exist, and are probably more 
familiar, in connection with the non-erasable media, 
such as punched paper tape, punched cards, and 
programmable read only memories (PROMS). However # the 
5 tremendous capacity and low cost per bit of digital 
c^tlcal disks pvayihSss^ s: stroxtg^ nto t i. vat ion far: nBore : ^ 
closely examining their one drawback, the non-erasable 
nature of the memory storage process • 

Therefore a primary object of this invention 
10 is an apparatus and method for reusing, that is writing 
more than once in, non— erasable memories. Other objects 
of the invention are an apparatus and method for 
reliably and simply increasing the effective capacity of 
non-erasable memories, and for providing a simple, 
15 easily implemented apparatus and method for rewriting in 
an otherwise non-erasable memory. 

Summary of the Invention 

In one aspect, the invention relates to 
apparatus for reusing a non— erasable memory medium. The 

20 apparatus features reading circuitry for reading. ; 
successive groups of digits. from the nozr-erasable memory 
medium and a process:: el emenfc fomwritlng over each o£:; 
the successive groups In accordance with both the input 
data information read at each successive group and 

25 respective new incoming input-data information. Thus, 
the process element provides an output in accordance 
with a predetermined "mapping" procedure or plan which 
.is determined by the two data information sources 
recited above. Thereafter, the "written over" groups of 

30 digits can be uniquely read, at a later time, to 
reproduce the just written new incoming input-data 
information. 
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In another aspect, the invention relates to 
apparatus for writing a plurality of times in a 
non-erasable memory. The invention features receiving 
circuitry for receiving incoming digital data and for 

5 forming sequential input-data groups from the received 
d&t&. A reading element reads successive SEQups- of 
digital data from successive groups of data positions in 
the non-erasable memory. In response to each formed 
sequential input-data group and a respective read 

10 successive data group from memory, mapping circuitry 
generates successive new coded groups of digital data. 
Writing circuitry sequentially writes each of the coded 
groups of digital data over the corresponding read data 
group. Thereby, the coded groups of data can be later 

15 read and uniquely decoded to generate the previously 
received incoming digital data. Depending upon the 
coding or mapping method, the non-erasable medium can be 
rewritten a substantial number of times. 



20 



In yet another aspect, the invention relates 
to a method for writing a plurality of times in a 
noar-erasable memory. The method features the steps of 
receiving input digital data, grouping the received, , 
input data into sequential- groups at data, reading front . 
the memory successive groups of stored digital- data, 
25 forming new successive groups of write digital data from- 
the stored digital data and the input data sequential 
groups, and writing the new groups of data over 
corresponding ones of the read groups from memory in a 
one-to-one relationship. The new groups of data are 
30 uniquely decodable to generate the input digital data. 



In yet another aspect, the invention relates 
to a method for reusing a non-erasable memory medium. 
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This method features the steps of reading successive 
groups of digits from the non-erasable memory medium, 
and thereafter writing over each successive group in 
accordance with (a) the data values of new incoming data 

5 information and (b) the just read data information 

written at the sxxaaessisn^ poslti ons ta be^^overwiclttea* 
The new data written into the medium is generated 
according to a predetermined mapping method of said data 
inputs* Thereby r the written-over groups can be 

10 uniquely read at a later time to reproduce the incoming 
data information. 

In another aspect of the invention, the 
apparatus relates to writing a plurality of times at the 
same group of bit positions (WITS) of a non-erasable 

15 digital memory. The apparatus features reading 
circuitry for reading the data stored at the bit 
positions and input circuitry for receiving new input 
data to be retrievably stored at the bit positions. A 
mapping circuit is responsive to the reading circuit and 

20 the input receiving circuit for generating a storeable 
codeword according to a.. mapping method, the newly 
generai^ted codeword being greater than or equal to (a^ is 
defined hereafter) the, word previously stored: at rttie bit- 
positions. The storeable codeword is then written, by a 

25 writing- circuit, at the bit positions. 

In particular embodiments of the invention, 
the mapping method can operate either according to a 
linear womcode or a tabular womcode as those coding 
methods are described hereinafter. In one particular 
30 embodiment of the invention, the mapping circuit employs 
a read only memory (ROM). 



wo 83/03912 



PCT/US82^00564 



-5- 

The method of the invention relates to writing 
a plurality of times at the same group of bit positions 
of a non-erasable digital memory. The method features 
the steps of reading the data stored at the bit 

5 positions and receiving input data to be retrievably 
stored at the hit. positions • The metHod.. further . 
features generating a storeable codeword in response to 
the read and received data, the storeable codeword being 
greater than or equal to the word previously stored at 

10 the memory bit positions. The method then features 

writing the storeable codeword at the bit positions, in 
effect overwriting the previously stored data. 

Brief Description of the Drawings 

Other objects, features, and advantages of the 
15 invention will become apparent from the following 

description of preferred particular embodiments of the 
invention taken together with the drawings in which: 

Figure 1 represents a simple diagrammatic 
representation of a method for rewriting in a 
20 noixr^&rasable memory; 



Figure. 2 is a., table describing tiie mapping v , 
method used in Figure, Ir 

Figure 3 is a . schematic block diagram of 
apparatus for rewriting in a non-erasable memory; 

25 Figures 4A-4B are more detailed 

representations of a mapping element of Figure 2; and 

Figure 5 is a table showing the optimum word 
length requirements for various groupings of word length 
in rewriting a non-erasable memory. 
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Description of Particular Preferred Embodiments 

Non-erasable memories , such as digital optical 
disks, are typically viewed as a write-once memory 
wherein the capacity of the disk is equal to the number 

5 of positions on the disk at which a data bit can be 
stoKd. Such th f nking Jiowevec. dops take intn : 

account the ability to neuse a disk, in the manner set 
forth below, which, in effect, increases the capacity of 
the disk substantially. The magnitude of the 

10 improvement varies depending upon the particular mapping 
method employed for reusing the disk and whether or not 
the memory being "reused" was originally written with 
that intent. As described below, very simple methods 
and apparatus can be employed to increase the capacity 

15 of a non-erasable medium by a factor of one- third. More 
complex methods and apparatus can be employed to obtain 
even more dramatic results. 

A non-erasable memory or "write— once memory" 
is designated a "WOH"« Each WOH consists of a number of 

20 "write-once bit positions" (called "WITS")j and each WIT . 
initiaily contains a •Lzero" that can later be 
selectively, but irreversibly, overwritten with a "oiie'^. 
T^picaily, a large WOH can be divided into data „ blocks:., 
or sections that are used as needed, an index being 

25 provided for exanple on an associated magnetic disk, to 
keep track of the valid blocks. The magnetic disk can 
be eliminated by using a linked-list or tree-like data 
structure on the WOM itself to. link obsolete blocks to 
their replacements; however, the effect of storing 

30 information on the WOH itself is to increase the cost in 
terms of access time. 

By using mapping or coding methods according 
to the invention, a WOH can be "rewritten" many times 
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and correspondingly it will have a "bit capacity" which 
is much greater than the number of its bit positions or 
WITS* Many of the coding techniques which are disclosed 
later are both simple to implement and can have a 
5 significant impact on the effective cost of using the 

Before describing the invention in more 
detail, a simple, but very effective, example can be 
employed to provide the basic understanding of the 

10 method for increasing the medium capacity, that is, for 
"reusing" or "rewriting" a non-ersable memory. 
Referring to Figures 1 and 2, as an example of a coding 
method to which the present invention is directed, 
consider a method which employs groups of three WITS or 

15 bit positions on the WOM to write two bits of data 

information twice. That is, by writing once, and then- 
again, in three bit positions of a non-erasable media, 
one can represent first, the data of a first group of 
two bits (written once) and, at a later time, the data 

20 of a second group of two bits (overwritten upon the same 
three bit positions used^ to represent the first group of 
two bits)* ^ 

Referring in particular . to the table of . Figure 
2, the initial two-bit input data value, "x^r is written 

25 the first time as a three bit position - space (three 

WITS) "y" according to a function y = r{x). Thereafter, 
data XI previously written as yi =» r(xi), can be 
"rewritten" in accordance with new input data 
X2 according to the function y2 = r' (X2) for X2 xi; 

30 and y2 yi for X2 = xi (that is, if the new data X2 is 
equal to the old data x^, the pattern yi is not changed 
and y2 = yi)- Note that r' (X2) will have ones wherever 
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r(xi) does (for any X2=xi) so that in going from r to r* 
one need only change zeros to ones* 

The Boolean three-cube of Figure 1, wherein 
the lower level o£. the cube represents the first 
5 generation of "writingr* and ttte t^J^ar level of:. tlieJc^ 
represents the second generation of "rewriting", 
graphically describes the writing sequence. 

Decoding of the data is quite easy. The 
memory word always represents the two-bit value (b+c) g 
10 (a+c) r independent of whether the WOM has been written 
once or twice. The EXCLUSIVE OR Boolean operation 
represented by the sign is a standard Boolean 

operation r is easily implemented , and is well-known to 
those skilled in the art. 

15 According to the method described in 

connection with Figures 1 and 2, four data bits can be 
written in three WITS of the WOM, Thus the method 
described above increases the WOH's capacity by a factor 
of one-third. 

20 Having introduced the concept:: of the method. ^ 

and. apparatus # it is helpful^ to introduce some notatloa. 
which makes the following more general discussion 
easier. The "weight" of a binary code word is defined 
to be the number of ones it contains. "x+y" will be 

25 used to denote the bit-wise EXCLUSIVE OR of the bit 

vectors "x" and "y" (which are both assumed to have the 
same length). A binary word x = xi.^.Xr is said to be 
"above" another binary word y « • •Ys (denoted "x 
greater than or equal y" ) r if r ~ s and Xi is greater 

30 than or equal *to yi for all i between 1 and r. All 
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logs, that is log(x), denote the logarithm to the base 2 
of X and the integer value of the logarithm is written, 
log ( X ) . 

A mapping method which uses n WITS to 
5 represent one of v values so ttrat it: can be written, a ,, 
total number of t times (that is, written once and 
changed t-1 times) is denoted as " (v) t/n-womcode". The 
"/n" can be dropped for an optimal womcode, i.e. one 
with minimal n. The general case, where the number of 
10 values can change from generation to generation/ is 
denoted by [v^,. . . ,vt]/n-womcode. The notation 
wC[vi.. .vtl) denotes the least n for which a 
l^j^f^fVi^i/n-womcoae exists. Similary, w([v]t) denotes 
the number of WITS needed for an optimal [v] * womcode. 

15" It may seem paradoxical at first that w([v]t), 

less than log(v)«t could exist. Intuitively the reason 
is that only the last value written needs to be 
accessible, that is, previous values may become 
irretrievable. In fact, if all previous values were 

20 always accessible, tben^the number of, WITS required , 
would. - be log ( V ) -t. - 

Referring to. Figure 3, an apparatus. -foar:.; 
imp lemen ting the various" mapping methods to be described 
herein "has an input data group formation circuit element 
10 for receiving, either serially or in parallel, input 
data on lines 12. The data group formation element can 
be, for example, a shift register designed to convert 
serial input data into input data groups of length k. 
The thus formed input data groups are provided over 
lines 14 to a mapping or coding circuit element 16. 
Element 16 can be, for example, a read only memory (ROM) 



25. 

-9 

30 
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(Fig 4A) whose output provides both decoded data over 
lines 18 and coded data groups over lines 20. The input 
to the mapping element 16 must include not only the data 
groups over lines 14 but code words previously written 

5 Cif any) which are provided from a media read circuit 

element 22 over^ litres Z4» v. The^^ncedla. readt ieXeinest 2Z^.^. , 
reads data groups £rasr a non->erasable medium 28 • The 
media read element 22 operates in a clocked synchronism 
with a media write circuit element 26. The media write 

10 element 26 takes the coded groups of data available on 
lines 20 and writes them on the non-erasable medium 28, 
for example a digital optical disk. The synchronization 
between read element 22 and write element 26 f through 
the coding or mapping element 16 r enables the write 

15 element to overwrite (or rewrite) at the precise 

position of previously read WITS (corresponding to the 
input to the element 16 over lines 24) so that 
previously written data is overwritten in a known and 
predetermined manner* 

20 As noted above # the coding element 16 can be 

far: example a read onl^ memory (ROM) 30 XFig. 4A). ^ The 
read only meceary takie? asT its vinputs^ the input data, 
groups on lines 14 and tbef: react datargraups aa;^lines..24» - 
These data inputs effectively address a jBord.of the BOH.. 

ZS and p r o vide at the output of the HDFI 30, a mult ibit word 
having two parts. One party over lines 18, represents 
the decoded data corresponding solely to the media read 
input over lines 24. The other part, over lines 20, 
represents the newly coded data corresponding to both 

30 the read inputs over lines 24 and the new data inputs 
over lines 14. Referring to Figure 4B, a typical ROM 
map is shown for the mapping sequence and method 
illustrated in Figures 1 and 2. (Alternatively, other 
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hardware logic, well known in the art, for example a 
discrete component Boolean logic decoding circuit, can 
be employed to encode the input signals over lines 14 
and 24 and to provide the required decoded and newly 
5 ' encoded output signals) . 

Referring to Figure 4B, it is interesting to 
note that when an already overwritten three-VJIT data 
word is read, and because no further writing is 

10 permitted, the output over lines 20 can be any 

convenient value. This flexibility can be employed to 
reduce the size of the ROM by combining a smaller 
capacity ROM with various logic elements to provide the 
necessary mapping called for by the method represented 

15 in Figures 1 and 2. 

While completely general observations 
regarding optimal coding methods have not beem 
completely worked out, the table of Figure 5 provides 
some available information regarding the minimal size 
20 for w([2Jc]t). Obviously, v(l2^}^) equals k and w( [21] 
equals, t. The remaining numbers have been derived 
eithec.hy hand or using-^ automated methods. . 

Attached to the application as Appendix 1 is a 
copy of a paper- -prepared by the inventors herein which ~ - . 

25 describes in great detail various aspects of the reuse 
of a non-erasable memory. That paper details much of 
the theory which has been employed in determining the 
advantages of the invention and in determining several 
methods for generating mapping systems to enable 

30 efficient transformations to be derived. The 

substantive work described in that paper, with the 
exception of that described in Section VI, is the joint 
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work of the inventors herein although editorial comments 
regarding the paper have been received from other 
sources. 

Examples of Other Methods for Forming Womcodes 
5 The- Tabular [vl ^/ir-W< jmcocFe r 

It is first assumed that t is greater than v* 
Further # let u denote an integer parameter to be chosen 
later (u to be chosen to be about equal to log(n))» The 
[v] t/n-womcode will have its n = r»s WITS considered as 

10 r = {u+l)(v-l) rows of s - log( v)+t/{u* (v-1) ) columns. 
Each row is divided into a log(v)-WIT "header" field and 
an (s-log{v) )-WIT "count" field. The log(v)-bit value 
represented by such a table is the sum (mod v) of the 
header fields of all rows that have an odd number of 

15 "l"s in their count fields. 

Thus, to write a value x when the table 
currently represents y, .it suffices to change a single 
"0" to a "1" in the count field of a row having the 
header^: x-y{ mod v). If ^ every row with this header -.has 

20 all. ones, in its count afield, we find new row which is 
currently all zeros both in its header^ and count:: fields.,, 
chaage the header to the des i red >value , . . and change one 
bit of the count field. "We can. always find a new row up 
until_u(v-l) rows are completely "full", since there are 

25 only v— 1 useful header values (the all zero value is 

useless as a header). Thus, we are guaranteed at least 
u( v-1) • (s-^log.(v> ) » t writes. 

Since n = r*s « t+t/u+log( v) (u+1) ( v-1) , 
choosing u equal to the integer value of log{t) implies 
30 n= t+p ( t ) . _ Th is code has a rate approx ima tely log ( v ) 
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which is less than log(n). The rate is the capacity per 
WIT of the womcode. With optimally chosen parameters, 
this code has a rate nearly equal to log(n) and is about 
twice as good as any other code described herein. 

5... The "Linear" Womcode 

This method enqploys the parameters v, t 
l+v/4r and n = v-1. The ith WIT is associated with the 
number i for i between 1 and v-1. The value ifepresented 
by any pattern is the sum (mod v) of the numbers 

10 associated with the WITS in the one state. {As an 

alternate definition, useful when v equals a power of 
2 , the pattern is interpreted as the EXCLUSIVE OR of 
the k-bit representations of the ntimbers associated with 
the WITS in the "1" state. For example, the [22] 2/3 

15 womcode of example 1 is a Linear womcode, and the coded 
pattern abc can be decoded as 01«a+10-b+ll-c. ) 

According to this code, so long as there are 
at least v/2 zeros, the W can be changed to represent a 
new value by changing at most two WITS. Thus, the 
20 mapfiing described above, has a rate roughly equal to 

log(v)/4 which is approjcimately equal.,to .log(n)/4- . i ; 

A [22]5/7 Womcode ; 

A [2^}^/7 (a rate equal to 1.42 approximately) 
womcode has been developed by an ad hoc method. The 

25 pattern is denoted as a seven WIT word abcdefg. If the 
pattern has a weight equal to four or less, the value 
represented is 01»coi+10*cio+ll'cii, where cqi = 1 if 
and only if ab equals 10 or (ab = 11 and one of cd or ef 
is 01), cio = 1 if and only if cd = 10 or (cd = 11 and 

30 one of ab or ef is 01) , and en = 1 if and only if ef = 
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10 or (ef = 11 and one of ab or cd is 01) • For example/ 
the pattern 1101100 represents 10. At most, one of ab# 
cd, or ef will be 11 if another is 01. Otherwise, the 
pattern is interpreted as the EXCLUSIVE OR of ab, cd, 
5 ef , and gg. The first three writes change at most one 
WTE: while the Isist tara writes may :^tai^ 

Additions, subtractions, deletions, and other 
modifications of the disclosed particular embodiments of - 
the invention, including new coding or mapping methodsy 
10 will be obvious to those practiced in the art and are 
within the scope of the following claims-. 
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HOW TO REUSE A "WRITE-ONCE" MEMORY 
(Preliminary Version) 
Ronald L. Rives t 
MIT -Laboratory for Computer Sciencer Cambridge^ Mass. 

Adi Shamir 

Wfeizmann Institute o£ Science, : Kehovot.r Israel 

Abstract 

Storage media such as digital optical disks, 
PROMS, or paper tape consist of a number of -write-once" 
bit positions (WITS); each wit initially contains a "0" 
that may later be irreversibly overwritten with a "1". 
We demonstrate that such "write-once memories* (WOMS) 
can be "rewritten- to a surprising degree. Por example, 
only 3 wits suffice to represent any 2-bit value in a 
way that can later be updated to represent any other 
2-bit value. Por large k, 1.29...-k wits suffice to 
represent a k-bit value in a way that can be similarly 
updated. Most surprising, allowing t writes of a k-bit 
value requires only t+o(t) wits, for any fixed k. For 
fixed fej approximately Tc't/loqU) wits are required as . 

1- >oo. >An n-wit WOM is shown to have a "capacity" ; : 

(i.e. ik-t when writing a k-bit value .trtinies) of tqi tor 

n-log(n) bits. 
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I . I n t r od uc t ion 

Digi1:al optical disks (a variation of the 
"video disks" used to store analog video data) are an 
excijting new storage medium. A single 12-inch disk 
costing $100 can be used to store over 10-''^ bits of data 
~ the equivalent of 40 reels of. magnetic, tape — ; a nd to 
provide access to any of it in 1/10 second. Such an 
order-of-magnitude improvement in the cost/performance 
of memory technology can have dramatic effects. (See 
[Bu80] , tHcSl] r lGo821 ) 

floweverr such capability is achieved at the 
cost of making the writing process irreversible. The 
disks are used as follows. Bach disk is manufactured 
with a thin reflective coating of tellurium. To write 
on the disk, a laser is used to melt submicron pits in 
the tellurium at specified positions, changing those 
positions from their virgin "0" state to a "1" state. 
To read the disk, the laser (at low power) illuminates 
each position on the disk; the lower reflectivity of the 
pits is easily sensed. 

The tremendous capacities and cheap cost per 
bit of /^gital optical disks provides a strong;: 
motivation to examine closely their one drawback —-their ; 
•'write-once" nature The . purpose of this paper is : thus rv 
to explore the true capabilities of such "write— once 
memories" (or woms). Other familiar examples of woms 
are. punched paper tape , punched cards , and PROMS 
(programmable read-only memories - in which the wits are 
microscopic fuses that can be selectively blown) . 

Large woms might naturally be used to store 
data that is more or less static: programs, documents. 
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pictures, data bases, or archival storage dxamps. If 
the data requires updating r the worn can be replaced by a 
freshly written worn. A large wom can be divided into 
blocks that are used up as needed? a index on an 
associated magnetic disk can keep track of the valid 
hlxxcdLS* The magnetic disk can he. eXimina^ted by using 
linked-list or tree-like data structures on the wom^ 
itself to link obsolete blocks to their replacements, at 
some cost in terms of access time. 

More formally, we model a wom as an array of 
"write-once bits" (or wits) which are manufactured in a 
"0" state but which can later be independently but 
irreversibly transformed into a "l" state. (We 
understand that some of the current recorder/player 
designs for digital optical disks are not capable of 
selectively changing individual zero bits within a 
previously written block. However # this seems to be 
more a matter of engineering than of fundamentals.) 

The main result of this paper is that by using 
appropriate coding techniques r a wom can be "rewritten" 
many .^imes, and that its "bit-^rapacitgr" is much greater ; 
than: the • number of its wits* Many of . the coding 
techniques proposed here are simple to implement r and 
can have, a significant impact; on the .cost of using woms...,. 

As an example of the kind of behavior we are 
interested in, the following coding scheme was a prime 
"movating example" for this research. 

Lemma 1. Only 3 wits are needed to "write 2 bits 
twice". 

Proof: We show how to represent a 2-bit value x in 3 
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wits so that it can later be changed to represent any 
other 2-bit value y. Firsts represent x with the 
pattern r(x) given in Table !• Later, a value y(y ^ x) 
can be written by changing the pattern r'(y)* (If x - y 
n.o change is made). Qbsexve that r*{y) will have ones 
wherever r(x) doesr so: that we iTee<J only change zeros to • 
ones. 



X 


r(x) 


r' (X) 


00 


000 


111 


01 


100 


Oil 


10 


010 


101 


11 


001 


110 



Table 1. A (22)2/3^womcode 

Decoding is easy: the memory word abc 
represents the 2-bit value (b®c),(a®c), no matter 
whether the worn has been written once or twice. 




2nd 

gener'atijon 



1st 

generation 



Figure 1* The (2 ) /3-womcode on the Boolean 3 -cube 
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II. Notation 

Let weight of a binary codeword be the number 
of ones it contains. Let x ®y denote the bitwise XOR 
of the bit vectors x and y (assumed to have the same 
length). We say that an binary word x = xi...Xr is 
•above" another binary word y = yi»..ys (denoted x >. y). 
if r = s and Xi > yi for 1 < i < r. Let log(x) denote 
the logarithm (base 2) of x (or, if the context requires 
an integer value, [log2(x)l). We use Zv to denote the 
set (0,1,. ..V —1) , and to denote the set of all 
binary words of length n. We say "f(n)a? g(n)" if 
limn^^oo f (n)/g(n) = 1. (The variable being taken to 
the limit should be clear from the context.) We also 
let H(p) denote the "entropy function" H(p) = plogd/p) 
+ (l-p)log(l/(l-p)). 

A coding scheme that uses n wits to represent 

one of V values so that it can be written a total of t 

times (i.e. written once and changed t - 1 times) we 

call a " [v] Vn-womcode" (read: a "v t-times into n-wit 

womcode"). The "/n" may be dropped for an optimal 

wom-code (with minimal n) .{reads an "optimal v t-times „ . 

wOTicode"). The general xase - ..where the number of , 
values may differ from generation ;to generation - we 

call. a "ivi,... ,vtl/n-womcode" (read: a "vi to v^ into, 
n-wit womcode"); here the value stored on tbe irth write 
may be any one of [0 , . • • ,Vj^— 1] . 

Let w( Ivi,. . . ,vt] ) denote the least n for. 
which a [vi,...,vtl/n-womcode exists. Similarly, we use 
w([vlt) to denote the number of wits needed by an 
optimal [v]t womcode. We are interested in 
characterizing the behavior of w([v]t) for large values 
of V or t as well as finding "practical" coding schemes 
for small values of v or t. 
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It seems at first paradoxical that w([v]t) < 
log{v)*t could happen. Intuitively r the reason is that 
only the last value written needs to be accessible - 
previous values may become irretrievable. In fact, if 
all previously written values were always accessible 
tfaeii::liag(v)* t wits :WOiiIjd be. required.* 

To make our definition of a womcode precise r 
we note that a {vj^ , . . . ^v^ )/n womcode can be defined to 
consist of the following parts (here let v = 
max(vir • • • rVt) ) : 

(1) An interpretation function that maps 
each X e Z 2 a value a (x) ±n 2^, 

(2) An update functions which gives for any x 
. and for any value y in Z^r either "j^" 

(i.e. undefined), or else a codeword 
y (Xry) - z e Z2 such that a (z) « y and 
z >^ X. 

We say that a and. ^ define a correct 

[virv»*^vti/n-womcode if tliey •gueirantee -at least .fci. : 
writes^* as A defined below, : lii order to define tiiis:- 
condition we first make /the following auxiliary . , i 
definitions. ^ ^ 

An acceptable sequence tiir*..,ijn] for a 
t^l* • • fVtl/n-womcode satisfies the conditions that 0 <^ m 
j< t and each ij^ 1 <^ j <^ m^ is in Z^j • Note that in 
particular the null sequence X is acceptable. 

We define the "write function" P mapping 
acceptable sequences to codewords (orJ_) as the 
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iteration of y starting from the all-zero word 

on (corresponding to the "initial state" of the worn) by 

the following equations: 

f^QTi if j = 0 

(We assume that y (J. ,y) - ifor all y. ) We say that an 
acceptable sequence "arrives at x" if that sequence is 
mapped by p to x. 



y ( p (tii,...ij-ll)fij)» if 3 >.I- 



We say that a codeword x is "used in 
generation m," or "is an roth generation codeword" if 
there is an acceptable sequence of length m that arrives 
at X. A codeword x is said to be "unused" if no 
acceptable sequence of positive length arrives at x, 
otherwise we say x is "used". 

If every codeword belongs to at most one 
generation we call the womcode "synchronized" - since 
all. acceptable sequences , that arrive at a codeword word 
arrive there "at the same time* (i.e. at the same, 
generation) • Otherwise the womcode is called 
•unsynchronized. " With a synchronized womcode one can 
always determine how many generations have been- written.; 
Note that our [22] 2/3 womcode is not synchronized since. 
000 belongs to the zeroth, first, and second 
generations. We say that a womcode is "almost 
synchronized" if the all-zero word is the only codeword, 
that belongs to more than one generation, and it belongs 
only to the zeroth and first generations. 

The laminar womcodes are an interesting 
special case of the synchronized womcodes: a womcode is 
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laminar if it is synchronized and the weight of every 
(used) codeword determines its generation. (That isr no 
two codewords of different generations have the same 
weight. ) 

We say tba^ tixe !KnECCQde .de£lj^^ 
"guarantees at least t writes" if no acceptable sequence 
of length t arrives at . This completes our formal 
definition of a [v^, . . • ,vt]/n-womcode defined by ci 
and V ; such a womcode is correct if it guarantees at 

least t writes. 

We will often identify an interpretation a(x) 
with its binary representation. (For example r in a 
[ 2^^] Vn-womcode each n-bit codeword represents a k-bit 
word.) 

We would like to note that we initially 

studied only the [2*^] ^/n-womcodesr but that we have 
since seen enough interesting examples of womcodes of 

the more general form to warrant including the more 
general . definition here.. 

We now ijitroducevour- ttree^ •cal^Erlexity ^ ' 
measures": . Pr If and C. 

Let P(t) denote the "penalty expansion factor" 
needed to guarantee t writes of values from Z^, for 
large v, compared to that needed for just a single 
write: 

w( [v] t) 

P(t) = lim (1) 

V — >oo log(v) 
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We will prove that P(2) = 1.29... and P(t) a^t/logCt). 

Let I(v) denote the asymptotic "incremental 
cost' of increasing t by one for a [vl t-womcode: 

I(v) = lim (2) 

t — ^oo t 

We shall prove the surprising result that I(v) = 1. 

We define the apparent capacity <in bits) of a 
E^l.'v-'Vtl/n-womcode to be log(vi. . .vt) ; we denote this 
as C( [vi,.,.vtl/n). ^ Similarly, let C(n) denote the 
apparent capacity (in bits) of an n-wit memory fields 

C(n) = max {log(vx...vt)|w((vi,...,vt]) < n} , (3) 

We shall demonstrate that C(n) = n-log(n)+o(n'log(n) ) . 
As an auxiliary definition, we let R( [v^, , . .v^l /n) « 
C( [VI,. . .vtl/n)/n denote the rate of the womcode. (This 
is just-vthe capacity per, wit, of the womcode.) 

rii; El^entary Observations ; 
Lemma 2. . . 

w(tvi-V2]^) < w{ [vil t)+w( [V2]*^> (4) 

Proof. Concatenate a [vi] *^-wonicode and a [V2J ^-wcancode . . 
to make a [v^. V<w( (viJ t) )+w( tV2) *) ) womcode. 
(Represent each value y in 2^.1 •v2 an ordered pair 
<yi'y2)' yie Zvi and y2 e 2^2- Use the womcodes to 

record yi and y2 separately). 
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Lemma 3. 



(w([v]t) is subadditive in ) 



w( [V] ^1+^2) < w( [vl*!) +w([v]t2) 



(5) 



Proof. Use side-by-side optimal [vl«^-and Ivl^2- 
womcodes to represent the sura (mod v) of the values 
represented by the two subcodes. To update, change one 
subcode to represent the difference {mod v) of the new 
value to be represented and the value of the other sub- 
code. This guarantees at least ti+t2 writes. (The 
alternative approach of writing the new value into one 
of the two subcodes would need extra wits to indicate 
which subcode was written last, unless the zero word is 
unused in one of the subcodes. ) 



w( 12^}^) < k*t. 

For small values of k and t, we can derive 
wU2klt) as given in Table 2. (Obviously, w([2l^ll) = k 
and-w« tzl]^) » t*) 

t 

1 2 3 4 5 6 ^ 



1 


1 


2 


3 


2 


2 


3 


5 


3 


3 


5 


7 


4 


4 


6 




5 


I 5 


8 




6 


1 6 


9 




7 


1 7 







The above lemmas (and w{ t2-^l-*' ) = 1) imply that 



Table 2. 



W([2'^l*) 
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We do not know the exact values corresponding 
to the empty positions of the table. 

The [23] 3/7 <rate 1.28...) and [22)2/3 (rate 
1.33...) womcodes indicated by the table are special 
cases of the general "linear" scheme presented in 
section V. 

The [22j5/7 (rate 1.42...) womcode indicated 
by the table is an ad hoc scheme? we show here how to 
decode a. 7-wit pattern abcdefg. If the pattern has 
weight four or less, the value represented is 
01.coi©10-cio©ll-cii, where coi= 1 aft ab = 10 or (ab = 
11 and one of cd or ef is 01), cio = 1 itf cd = 10 or (cd 
= 11 and one of ab or ef is 01) , and Cn = 1 ii£ ef = 10 
or (ef = 11 and one of ab or cd is 01). (For example, 
the pattern 1101100 represents 10. At most one of ab, 
cd, ef will be 11 if another is 01.) Otherwise the 
interpretation is ab & cd © ef ® gg. The first three 
writes ^change at most one ij^itV while, the Oast two might 
each change two wits.^ , 

The following notation for the size of the- - 
tail. of a binomial distribution and a related inverse 
quantity will be useful: 



(O I Q 



(6) 



6 (v,m) = min {h | 




> v>. 



(7) 
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Note that a [v] "t/n-womcode must have n >^ m + 6(v,m) if 
every first generation codeword must have at least m 
zeros* We derive a lower bound 2(Vrt) to w{[vlt) by 
generalizing this observation: 

2(VrO) « 0 andv (8) 

Z(v,t + 1) = Z{Vrt) + 6(v,Z(v^t)) for t >^ 0. (9) 

Lemma 4. 

w( [v] t) >^ Z(v,t) (10) 

Proof. By induction on t. The case t = 0 is trivial. 
A [v] t+l/n-womcode must have at least Z(Vjt) zeros in 
every first generation codeword, and must turn on at 
least 6 (v,Z(v,t)) wits in the worst case on the first 
write to have v codewords in the first generation. 

Corollary. 

w( 12^}^) > k + t - 1 (11) 

Note-axat:Z{2*^,l) = k an* :Z(2?S t + D >. a(2^rt) + 1 for. 
k > 0* The following X emma tmpruve s th is : resu 1 1:, ( by ; : 
one) • . 

Lemma 5. 

w ( [2k] t) >^ k + t for k >^ 2 and t >^ 3. (12) 

Proof. Suppose to the contrary that a [2k]t/(]^ + t - 
l)-womcode existed for k 2l 2 and t >^ 3. Since Z(2^,l) = 
k, the generation t — 1 codewords must each have weight 
less than t. On the other hand, if t >^ 3 then for every 
value y e Z2k there is a t - 1-st generation codeword x 
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With a(x) = y and weight at least t - 1. (For t = 2 the 
claim fails if the zero-weight word is in the first 
generation.) There must be at least 2^^ - .1 > 3 
different values y associated with first-generation 
codewords of weight 1 or more. Thus for every value 
yeZ^It there is a t - X-stL generation, codeword, x .with, 

a(x) = y and weight exactly t - I- But then no 
possible interpretation for the codeword lk+t-1 is 
distinct from each of these values (required since the 
last k levels are -tight"). This contradiction proves 

the lemma* 

IV. HOW many wits are needed for a fixed number of 
generations? , . _ — _ 

IV. A. How many wits are needed for two generations? 

w( [vj2) e 1.293815. ..log(v) (13) 

Proof. For any v, choose h to be 6(v, log(v)) and then 
chose n to satisfy: 



n - h = riog(v) + log Ibg(v) + 1 -rlog log(e)l. (14) 

We Will Jprove that w([vl2) ^ n. Choose the first 
generation representations arbitrarily as distinct ' 
codewords with weight at most h, and ramdomly assign to 
the remaining 2n - v codewords interpretations from Zy. 
There are 

(v)2n-v (15) 

ways to do this. How many ways do not guarantee two 
writes? Such a bad assignment must contain a first 
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generation codeword x and a value y e Zy (a (x)} such 
that no codeword z >^ x represents y. If we select x in 
one of V ways, select y in less than v - 1. ways^ assign 
all codewords z >. x values different than a(x) and 
asalgn all other codewords arbitrary values, we will 
have overcounted ther: baS: codes but exanrined no:: gore .tiian 

v2.(v - 1)2^^^*^ . (v)2^v-2n«h^ (Ig, 

codes. Whenever (16) is less than (15) some "good" 
codes must exist* This happens when 




(17) 



which will happen if 

21og(v) < 2n-li-log(v> • log(e) (18) 

which is implied by 

. Tt^ tt:^ riog(v) + log log(v) -i-l - log logCe)]. :C19) 

Thus (19) 'implies the existence of a [vj */|x-womcode^ 
Since n >^ h + log(v) (from Lemma 4), we conclude that 
for an optimal [v] 2/n-womcode 

n - h -i- log(v> + o(log(v)). (20) 

Now the logarithm of the number of words of length n 
with at most h ones is 

n*H(h/n) + o(n)^ for h < n/2. (21) 
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(See IPW72, Appendix A], or IMS77, Ch. 10, SHI - ) Since 
there are t values in the first generation, 

n.H(h/n) + o(n) « log(v) (22) 

or , (since log(v) = q..- h. 4- Q(log(v>) and. n 2r.log(v) ) 



(23) 



The equation H (p) = 1 - p has a solution at p = 
0.22709219..., so for an optimal womcode h/n = .227..., 
or- log(v) s! n«(l - .227...) 
or 

n « 1. 29381537... iog(v) (24) 

which was to be proved. 

The random womcodes of the theorem will have 
an -asymptotic rate of 2/1.29... » 1.5458..., much better 
than tiie rate 1.33... womcode of lemma 1. However, we-^ ^ 
could : not - cons tract „ by., hand: a .125^1 2-vt3mcode of . rate . 
higher than 1.33.... Lemma 4 implies that such a scheme 
must have k = 7, n = 10 or k >^ 9- Using .a computer .we . 
found ja.. slightly more efficient method with rate 
1.34. • 

The new scheme is a [26] 2/7-womcode (rate = 
1,3429...). So a seven-track paper tape is "reusable" 
for writing just letters! Row i, column j of Table 3 
gives the value (a letter) of the 7-bit string with 
binary value i * 32 + j. The first-generation is in 
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upper case. Thus a "T" (0011000) is made into an "h" by 
changing bits 1, 2, and 5 (to obtain 1111100). We were 
unable to find a [271 2/7-woincode or to prove one doesn't 
exisjir although we can prove that a [29] 2/7 womcode 
doesn't exist. 

00000000001111111111222222222233 
01234567890123456789012345678901 



0 AHGGFYLwEZYrXfpnDWVzDdjoTwkeltdu 

1 CSRcQiozPpihuexyOzs jsniwvcqgfkbm 

2 BNMzLbgmKutbngfwJwrhkvxyrajpsoqci 

3 Ikmqlckuwteosdjvubdfgetpyxnlhrza 

Table 3. A ( [26] 2/7 ) -womcode 

IV. B. What is P(t)? 

By reasoning similar to that o£ the proof o£ 
Theorem 1, we derived the following estimates for P(t). 
Note how closely the estimates are to t/log(t}. 



t 


P(t) (est.) 


t/log(t) 


1 - 


1.000 




2. 


1.294^ 


. 2.000 . 


3 


1.549 


1.893 


4 


1.783 


2.000 


5 


2.003 


2.153 


10 


2.983 


3.010 


20 


4.668 


4.628 


50 


8.960 


8.859 


100 


15.191 


15.051 


200 


26.346 


26.164 


Table 4. 


P(t) (est.) vs. 


t/log(t) 
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To demonstrate our main result that 
P(t). * t/log(t), we define an upper bound to w([vlt) 
which is asymptotically equal to our lower bound Z{v,t) 
o£ Lemma 4, to withia a small additive term* 

Theorem 2. For fixed t and v sufficiently large, a 
sufficient condition for the existence of a 
[v] t/n-womcode is the existence of t numbers li, 1 £ i 
< t, such that 

(a) t-log(v) >. n >^ li >. l2 >. 2l ^t 2. °' 



(b) 

Ai \ 

(c) Vli+1/ > v-(t + l)-log(v), for 1 £ i <. t. 

Proof . 

We prove the existence of a [v] t/n womcode in 
which, all i-th generation codewords contain exactly Ij 
zeros. Condition (b) implies that there are enough 
codewords with 1-^ zeros for: the v values of the first ; , 
generation. We . now show (by a counting airgument) that 
for all i, 1 _< i < t it is. possible to assign 
interpretations to the codewards with li+i zeros for 
the i + 1-st generation in such a way that for every j, 
0 < j < V, every codeword with li zeros is below some 
codeword with li + i zeros that has been assigned 
interpretation j- 

The total number of ways in which the U-i+iy 
codewords with li + i zeros can assigned values is: 
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vdi+i). (25) 

We can overcount the number of "bad" ways of assigning 
interpretations to the codewords with li+i zeros (assum- 
Ingr w& have alreadjf assigned.' iztterpcetatdLons. . ta. .the.-, 
earlier generations)/ in the following way. Choose a 
codeword x with li zeros, choose a "missing value" yeZvr 
assign the ^i+i) codewords with li+i zerbs above x with 
the V - 1 remaining Values (other than y) , and assign 
the other codewords with li+i zeros arbitrary interpre- 
tations. The number of "bad" ways is thus at most: 



(liii) - (li+i) 

(v-l) - V (26) 



A "good" way must -exist whenever (26) is less than (25) 
By simplifying this inequality, we gets 



< 1. . ' (27) 



Siiice. n X flog(v) (othervlse the existence of the i 
desired womcode is trivial)* ("j^) <.y*r and. thus it is 
enough to prove r- 






t+i / \ \ii+i/ 

1 P ^ < 1. (28) 



By condition (c), Ui+l/ ^ * log(v) • (t + 1), and 
thus it suffices to prove that: 
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l\v • log(v) • (t+ 1) 



< 1. (29) 



Hufc for large enough Vr approaches snd., 

thus the left hand side is approximated bys 

vt+i . e-iog<v>-<^+i> ' 

which approaches zero as v goes to infinity. 

To find the smallest (or nearly smallest) n 
foe -which the existence of a [v] ^.womcode is guaranteed 
by the theorem, the numbers li should be chosen in 
reverse order (from 1^ to li). The last two numbers can 
be chosen as: 

log(v) 

1^ s + c log log(v), (31) 



where c is any constant greater than 1, and 

1^-1 .= log (v> ■»- 2c log log(v), . (12) 



sxnce 





V (log(v)2c) 



(33) 



log(v) + 2c log log(v). 



Since c > 1 and t is fixed, this becomes larger than v* 
log(v).(t + 1) for V sufficiently large. The other li's 
can chosen as the smallest numbers satisfying condition 
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(c) of the theorem. Finally # n can be chosen as the 

smallest number satisfying (l^j^) v. 

We now proceed to analyze the performance of 
the womcodes described above r in order to show that 
their performance is. . aaymptotically equal to that of .the 
lower" bound we prove<i- in Denmra 4. Then we pruve otnr.:. *\ 
main theorem (theorem 4) that P(t) ^ t/log(t>« 

We first introduce some necessary notation. 

Let 



/m+h\ 



« Mv,in) "ininChj [ 1 >. v } . (34) 



(Note tihe similarity to the definition of 5 in (7).} 
Then we define: 

log(v) 

Y^{v,0) = +c log log(v), (35) 



Yu<V'l) = log(v) + 2c log log(v), and 136) 

Tu(v,t X) = Y„^(v>t) + 5*(v«u-log(v>^ Y^(Vrt)), 

for t > 1. (37) 



For convenience in the next theorem, we define y(v,t) to 
be Yiog(v) note that for large v, Y(v,t) Yt+1 

(v,t). Prom this definition it follows that for v 
sufficiently large r 

w( [V] t) < y(v,t) , (38) 
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since li = Yt+i(v,t - i) f or 1 < i < t and n < Yt=l<'«''^> 
in the construction of last theorem. 

Theorem 3. For t >. 1» 

lim =1. <39). 

V — ^oo Z(v,t) 

Proof . 

By induction on t. The case t » 1 is trivial. 
By comparing the forms of the definitions of Y and Z, we 
see that it is enough to proves 

6* (v.{log(v) )2,m' ) 

li„ : = 1, (40) 

V — >oo 6{v,m) 

where m = Z(v,t - 1) and m' = y(v,t - 1). We observe 
that 

6(v,m) < fi'Cv.m) <.6(v,m) +1 (41) 

if jn >^log(v) (since that implies that6(v,m) <. (m/2)). 
(Note that in (40) both m and m* are-2L. lo9(v> • > ""^ 
can replace 6' by 6 in (40). Furthermore, 5(Vrm) is,a ?. 
decreasing function of m, ,so that, we . can also replace m' 
by the smaller value m in (40). In a similar vein, it 
is simple to show that m >^ log(v) implies that 

6(v (log(v) )2,ni) < i(v,m) + 2 log log(v), (42) 

and a little more complicated to show that log(v) £ m £ 
Y' log(v) inqplies that 
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6(v,in) >. log(v)-(2T -H-ld/ZY)) (43) 

Combining these observations leads to the desired 
result. 

Theorem 4. P(t) = t/logCt-V* 
Proof. 

Let n^. «» Z(v,t). Then we must haves 

nt 
nt - nt-i 

so we derive 

H(nt-i/ht) = log(v)/nf (45) 

We consider H(p) near p » 0 using the fact that H(p) = 
H(l - p)s 

1 - nt-i/nt« H-l(logCv)/nt)- (46) 

Near^ p =^ 0, H (p) = p« logCl/p) » so .HTL^ (y) = -y/log(y) r; 

nt-i — log(v)/nt 
1- a (47) 

nt log(log(v)/nt) 

**t ~ nt-l a - log{v)/log(log(v)/nt) (48) 
dn^ -log(v) 



(49) 



dt log(log(v)/nt) 
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log(nt/log(v) ) -dnt 

dt « 

log(v) 

t « ( -) log ( ) 

log(v) log-<v) 

= P(t) « (52) 

log(v) log(t). 

As a consequence of Theorem 4 / for fixed t and 
large v, an optimal Ivl* womcode will have a rate 
approximately equal to log(t), with the approximation 
improving as t increases. 

V. What is I(v)? 

In this section we demonstrate that I(v) = 1 

for any v, using a "tabular" womcode. We also present a 

"linear" womcode that - while it only shows that I(v) £ 

4, generalizes nicely our [22] 2/3_Voincode. 

V.a.. : The Tabular, [vl Vn-Womcode 

We assume here that. t > v. .Xet u.. denote an:, .. - 

integer parameter to be ..chosen . later, .(imagine that. u. is., 
about log(n}). Our [v] t/n-womcode will have its n = r-s 
wits considered as r = (u -i- l)(v - 1) rows of s = log{v) 
+ t/(u«(v - 1)) columns. Bach. row is divided into a 
log(v)-wit "header" field and an (s - log(v))-wit. 
"count" field. The log(v)-bit value represented by such 
a table is the sum (mod v) ) of the header fields of all 
rows that have an odd number of "l"s in their count 
fields. 



(50) 



. (51) 
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To write a value x when the table currently 
represents it suffices to change a single "O" to a 
"1" in a row with header x - y {mod v) • If every row 
with this header has all ones in its count field, we 
find' a new row which currently is all zeros both in its 
header and count fields,, change the header to the ^ 
desired value, and cSiange one bit of ttie camiL fi e l d^ ^l 
We can always find a new row up until u(v — 1) rows are 
completely "full**, since there are only v - 1 useful 
header values. (The all-zero value is useless.) Thus we 
are guaranteed at least u(v - 1) • (s - log(v)) = t 
writes. 

Since 

n = t + t/u + log(v)(u + l)(v - 1), 

by choosing u « llog(t)J implies that n « t + b(t)« 

This code has rate approximately log(v) < 
log(n). With optimally chosen parametersr this code has 
a rate nearly log(n)r about twice as good as any other 
code? presented in this paper^* r. 

V.B..- The "Linear" Womcode 

This scheme has parameters v^ t = 1 -J- v/4, and . 
n = V - 1. The i-th wit is associated with the number i# 
for 1 <^ i < V. The value represented by any pattern is 
the Sim (mod v) of the numbers associated with wits in 
the "1" state. (An alternative definition, useful when 
V = 2k, interprets the pattern as the XOR of the k-bit 
representations of the numbers associated with wits in 
the "1" state. For example, in our [2^] 2/3-.^omcoae the 
pattern abc can be decoded as 01 « a® 10 * b0 11 * c.) 
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We now show that - as long as there are at 
least v/2 zeros - we can change the worn to represent a 
new value by changing at most two wits. Let z denote 
the difference (modulo v) of the new value desired ^ Y/ 
and the current value represented r If the bit 

associated with z is now "Q" we can simply change it to 
a ''I''^ Otherwise let S denote the set of numbers^ 
associated with wits which are currently zero, and let T 
denote the set { z — x (mod v) | x e S}. Since | S | = j T j 
>^ v/2 and | SUTj < v - 1 (zero is in neither set)# the 
set SOT must be nonempty. Their overlap indicates a 
solution to the equation z = + X2 (mod v) where xi 
and X2 are elements of S. 

The code described above thus has rate roughly 
log(v)/4 « log(n)/4. 

The linear womcode described above may have to 
stop when there are as many as (v/2) - 1 zeros left 
(which can happen after as few as 1 + (v/4) writes). 
The following trick allows one to keep going a while 
longer. Divide the n-wit field into n/3 blocks of size . 
3.. Hy writing additionalJ''l*s if necessary, force each, 
block, to have either one ."I* or three "l^s exactly*; . 
Those "bad'' - blocks having no zeros remaining are now \ 
considered as deleted r while each of the remaining v,. 
"good" blocks can be used to store one bit (by changing 
one of its wits - the other one is left untouched to 
indicate that the block is a good block and not a 
deleted block). With at least {n-l)/2 zeros remaining 
we are guaranteed* 'of getting at least n/12 - 1 "good" 
blocks. The recurrence: t(n) = n/4 + t(n/12) has the 
solution t(n) = 3n/ll + 0(1) , Indicating that this trick 
can increase the number of writes we can achieve by a 
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factor of 12/11 (from n/4 to 3n/ll writes). At the 
moment this coding trick is also the best general scheme 
we know of for making use of a "dirty" WOM that may have 
been written before in an arbitrary manner (i.e. without 
any thought of using some sort of "womcoding" for better 
u±i 1 i zation) , 

VI. What is C(n)? 

The schemes presented in the last section can 
be used to show that C(n) • n»log(n) + o(n • log(n))« 

Theorem 5. 

C(n) >^ n log(n) + o(n log(n)) 

Proof. For a given large memory size n, we can use the 
"tabular" scheme of section V.A. and choose parameters: 
log(v) = llog(n) - 2 log log(n)J with r = I log log(n)J* 
(v - 1) rows of length s = In/rJ . (We will "waste" n - 
rs wits.) As before r the total number of writes possible 
is n - o(n), proving the theorem. 

It is also possible to show that this result . 
ia "^rest possible" r : 

Theoresr 

C(n) < n-iog(n) 

Proof. (Intuitively, changing one wit out of n should 
provide at most log(n) bits of information.) Consider 
any [V]^,. . . ^v^l/li-womcode. The n-wit field can undergo 
at most (t + 1)^ < n'^ "histories" as it progresses from 
its first state (.all "0"s) to its final state (perhaps 
all "l"s), since we can describe the history by 
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specifying for each of the n wits that it either always 
remains "0" or that it is turned to a "l" during one of 
the t write operations. On the other hand, the womcode 
has at least vi...vt different acceptable sequences of 
length t to handle, each of which must have its own 
history. The theorem follows. 

VI. Other Womcodes 

Several of our colleagues have become 
intrigued by the problem of designing high-rate 
womcodes r and have graciously consented to our sketching 
or referencing their preliminairy results here. 

Prof. David Klarner (Dept. Mathr SUNY, 
Binghampton) , has created an elegant [5] 3/5 
(rate 1.39...) cyclic womcode r which works as 
follows. The first value is represented by 
10000 in the first generation r either 01001 or 
00110 in the second generation, and one of 
01111, 10110, or 11001 in the third 
generation. The other four values are handled 
similarly, using cyclic rotations of the words 
given for the ficst value. (Since n is prime.., . 
all the . cry clic^ rotations arerdistinct. ) - : 

David Leavitt (an undergraduate working with .. 
Prof. Spyros Magliveras, Dept. Hath, Dniv. 
Nebraska at Lincoln) , has found an even more 
efficient [714/7 (rate 1.60...) cyclic womcode 
by extending Klarner' s technique. (To 
appear. 1 

James B. Saxe (a graduate student at CMU) has 
created the following beautiful (n/2,n/2 - 
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l,.*.fl]/n womcode (rate asymptotically 
log(n)/2)r where the two halves of each 
codeword are the same except that the left 

half has an extra "1" bit. The value 
represented is the number of zeros in the left 

half to the left of the extra bit. Each 
update (except tire fxrstr>r c hanges egractrly two r 
wits — one in each half. 

Saxe also suggested the following marvelous 
recursive woracoder which uses n = 2^ witsr and 
changes exactly one wit per write. Using fin) 
to denote the capacity of Saxe*s code, we 
shall see that fl2^) « k*2k-l, using the base 
case /(I) * 0, giving a rate o£ log(n)/2. 
Partition the n wits into n/2 pairs. With the 
first n/2 writes we turn on at most one bit in 
each pairr and obtain capacity /(n/2) + n/2 by 
using the code recursively on the n/2 pairs, 
and getting an extra bit/write using the 
parity of the number of pairs whose left bit 
is on. For the second n/2 writes we obtain 
capacity /(n/2) recursively on the pairs by 
turning on their' second.* bits as : needed. The . 
recurrence fin) — n/Z .+v:2/(n/2> : giveff rttxe , 
desired result* 

Saxe has also created a [65,81,631/12 (rate 
1.52...) womcode that can be improved to a 
[65,81,641/12 (rate 1.53...) womcode if the 
"generation number" is' externally available 
when decoding. 



VII. Discussions and Conclusions 

results p res ented in this paper provide 
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much information about the nature of the function 
w(tvlt). On the basis of the evidence so far, we 
conjecture that 

log(v)*t 

w([v]^) s max(tr "> 

log(t) 

for all large v and t. We expect that this result 
should follow in a more-or-less straightforward manner 
from the results and techniques given here^ but we have 
not as yet worked through a detailed demonstration. 
(Exercise for the readers prove that w([2k]k) » 
B(k2/iog(k)).) 

The relationship between worocodes and 
error-correcting codes are interesting s we can view a 
womcode as a situation where the channel is assymmetric 
(only 0 — ^1 errors occur) r and where the transmitter 
knows where the errors will occur before he has to 
choose a codeword. Of course r there are still many 
differences, since the objective of womcoding is to 
aJLlow many "messages" to. -.be.-sent and the codeword for 
one message determines what .the. •errors?, are for- the,,, 
next message. 

A more Closely related problem may be thatof"* 
devising codes for random- access memories that have 
"stuck bits". Heegard [He81] has some recent work in 
this area. Again, however/ the problem seems 
intrinsically different. 

Some interesting work has been done on Turing 
machines that have "nonerasing" work tapes (e.g. see 
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[Mi67] ), which is peripherally related to the research 
reported here. 



requires that the decoding scheme for an [v] ^/n-womcode 
provide a unique interpretation £or each possible, 
pattern of the n vitS/ ind:ependentr of hovr many 
generations have been written. In some cases the 
current "generation number" might also be available as 
input to the decoding scheme at no extra cost (in wits). 
WhiXe' this variation might permit some minor performance 
improvements in some instances , it remains an open 
question as to how much this additional information 
might help. 



investigations 

What if there is some restriction on the kinds 
of updates that may occur? (For example, what 
if y can replace x only if y is numerically 
greater than x?) 

What advantages :^2cre . there to representing ra : 
different number^: of: values at each generation? ^ 

What is the complexity of the decoding and 
updating algorithms for the best codes? 

How can these coding schemes be adapted to 
handle the possibility of errors occurring on 
the worn? 

If the underlying storage medium is viewed as 
storing a modulated digital signal rather than 



We note that our formulation of the problem 



A number of questions need further 
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a sequence of bits, what kind of •'womcoding*' 
should be used to allow updating yet to 
maximize bandwidth while minimizing modulation 
frequency? (See [BrSlJ , [HG691 ) . 

What can he said about the average-case 
behavlojr o£ vomcodes? . 

What if the storage elements had more than two 
possible states r and had a complicated dag 
that described the set of legal state 
transitions? 

What truly practical womcodes exist? 
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What is claimed is: 

1. Apparatus for reusing a non-erasable 
memory medium C9raprising 

means for reading successive groups of 
digits from said non-erasable memory medium, and 

5 , means responsive ta said. read, groups of 

data from said reading means and to incoming input data 
information for writing. over each successive group 
according to a predetermined plan so that said written 
over groups can be uniquely read to generate said 

10 incoming input data information. 

2. Apparatus for writing a plurality of times 
in a non-erasable memory comprising 

means for receiving incoming digital data 
and for forming sequential groups from said received 
•15 data# 

means for reading successive groups of 
■digital data from successive groups of positions in the 

non-erasable memory r 

means responsive to each said formed 
20 sequential group and to a respective read successive 
gro«^ f rom memory for generating successive new coded, . , 
grotq^^ of digital data* and.. 

means for sequentially writing into said 
memory each said coded group of data at . a said , group of 
25 positions corresponding to said respective read group of 
data# - .. * 

whereby 'said coded groups of data can be 
later read and uniquely decoded to generate said formed 
sequential groups • 

3Q 3. The apparatus of claim 2 wherein said 

non-erasable memory, prior to storage of any data, is 
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comprised in its data storag area a succession of 
"zeros" and wherein data is stored by converting one or 
more of said "zeros" to "ones" r and further wherein 

said responsive means comprises means for 
5 writing a non-decreasing number of "ones" as said groups 
of positions are written over. 

4, Apparatus for writing a plurality of times* 
at the same group of bit positions of a non-»erasable 
digital memory comprising 
XO means for reading the data stored at said 

bit positions, 

means for receiving input data to be 
retrievably stored at said bit positions # 

means responsive to said reading means 
15 and said receiving means for generating a storeable 

codeword r said codeword being greater than or equal to 
the word previously stored at said bit positions, and - 

means for writing said storeable codeword 
at said bit positions. 

20 5. The apparatus of claim 4 further 

comprising 

means connected to said reading means.for: 
uzuuguely. converting sald.% read data to^outputr^data-^r - • 

. 6. The apparatus of^ claim 4 wherein said , 
25 responsive means comprises 

* means for' generating said storeable 
codeword according to a linear womcode* 

7« The apparatus of claim 4 wherein said 
responsive means comprises 
30 means for generating said storeable 

codeword according to a tabular womcode* 
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8# The apparatus of claim 4 wherein said . ' 
r sponsive means comprises 

a read only memory element responsive to 
said reading me^ns and said receiving means for 
5 generating said storeable codeword. 

9 . Th^ .asgaxratxis of . elaiitr a whereia said read 
only memory further converts said read data to a decoded 
data word. 

10. A method for reusing a non-erasable 
10 memory medium comprising the steps of 

reading successive groups of digits from 
said non-erasable memory medium r and 

thereafter writing over each successive 
group in accordance with the data values of new incoming 
15 data information and the read groups from said memoxTT 
according to a predetermined plan, 

whereby said written over groups can be 
uniquely read to generate said incoming data 
information. 

20 11. A method for writing a plurality of times 

ia ^ nan-erasable memory comprising the steps of 

receiving input digital -da.ta; 
■ grouping said received input data into 
sequential groups of input data; 
25 reading from said memory successive 

groups of stored digital* data, 

forming successive groups of write 
digital data from said stored digital data and said 
input data sequential groups, said groups of write data 
30 being uniquely decodable to generate said input digital 
data, and 
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writing said groups of write data at 
memory positions corresponding to respective ones of 
said read groups of data in a one— to-one relationship* 

12 • The method of claim 11 further comprising 
5 the step of repeating . said receiving, grouping, reading, 
EuLAttl ng and writing: gfeggg fcnr: writing a ngrfc sequf^nc fe . of,, 
input data in said non-erasable memory. 

13. A method for witing a plurality of times 
at the same group of bit positions of a non-erasable 

10 digital memory comprising the steps of 

reading the data stored at said bit 

positions, 

receiving input data to be retrievably 
stored at said bit positions, 
15 generatina[^a storeable codeword in 

response to said ^^^fj^^ and receiving steps, said 
codeword being greater than or equal to the word 
previously stored at said bit positions, and 

writing said storeable codeword at said 

20 bit positions. 

14. The method of jclaist U further^ comprising 
thevstep of 

convertings said data read during said 
reading ^^ep to output data. 

25 15. * The method of claim 13 wherein said 

generating step comprises the step of generating - said 
storeable codeword according to a linear womcode. 

16. The method of claim 13 wherein said 
generating step comprises the step of generating said 
30 storeable codeword according to a tabular womcode. 
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17, Th method of claim 13 wherein said 
generating step comprises generating said storeable 
womcode using a read only memory element* 

18* The method of claim 17 wherein said 
generating step comprises the step of converting said 
read data to decoded, outptrt, data words.* 
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